CDS

Accession Number TCMCG036C12401
gbkey CDS
Protein Id PTQ38391.1
Location complement(join(285977..286036,287139..287651,287817..287944,288434..288530,288762..289181,289448..289669,290976..291250,291366..291495,291899..292125,292261..292308,292603..292636,292769..292924,293068..293163,293337..293406,293520..293575,293826..293921,294181..294825))
GeneID Phytozome:Mapoly0051s0023
Organism Marchantia polymorpha
locus_tag MARPO_0051s0023

Protein

Length 1090aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772723.1
Definition hypothetical protein MARPO_0051s0023 [Marchantia polymorpha]
Locus_tag MARPO_0051s0023

EGGNOG-MAPPER Annotation

COG_category L
Description DNA mismatch repair protein
KEGG_TC -
KEGG_Module M00295        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08737        [VIEW IN KEGG]
EC -
KEGG_Pathway ko01524        [VIEW IN KEGG]
ko03430        [VIEW IN KEGG]
ko05200        [VIEW IN KEGG]
ko05210        [VIEW IN KEGG]
map01524        [VIEW IN KEGG]
map03430        [VIEW IN KEGG]
map05200        [VIEW IN KEGG]
map05210        [VIEW IN KEGG]
GOs GO:0000217        [VIEW IN EMBL-EBI]
GO:0000404        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003677        [VIEW IN EMBL-EBI]
GO:0003684        [VIEW IN EMBL-EBI]
GO:0003690        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0006139        [VIEW IN EMBL-EBI]
GO:0006259        [VIEW IN EMBL-EBI]
GO:0006281        [VIEW IN EMBL-EBI]
GO:0006298        [VIEW IN EMBL-EBI]
GO:0006725        [VIEW IN EMBL-EBI]
GO:0006807        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0006974        [VIEW IN EMBL-EBI]
GO:0006996        [VIEW IN EMBL-EBI]
GO:0008094        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008152        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016043        [VIEW IN EMBL-EBI]
GO:0016462        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016817        [VIEW IN EMBL-EBI]
GO:0016818        [VIEW IN EMBL-EBI]
GO:0016887        [VIEW IN EMBL-EBI]
GO:0017111        [VIEW IN EMBL-EBI]
GO:0030983        [VIEW IN EMBL-EBI]
GO:0032135        [VIEW IN EMBL-EBI]
GO:0032300        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0033554        [VIEW IN EMBL-EBI]
GO:0034641        [VIEW IN EMBL-EBI]
GO:0042623        [VIEW IN EMBL-EBI]
GO:0043170        [VIEW IN EMBL-EBI]
GO:0043570        [VIEW IN EMBL-EBI]
GO:0044237        [VIEW IN EMBL-EBI]
GO:0044238        [VIEW IN EMBL-EBI]
GO:0044260        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0046483        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:0051276        [VIEW IN EMBL-EBI]
GO:0051716        [VIEW IN EMBL-EBI]
GO:0071704        [VIEW IN EMBL-EBI]
GO:0071840        [VIEW IN EMBL-EBI]
GO:0090304        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:1901360        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]
GO:1990391        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCGCAGCAACAGTCTCTCTTCTCGTTTTATTCTAGAGGAGGGGCTGCGGCGAAACCGAAAGCAAATTCGCATCGATCGGATTTTTCGGTTGTGAAGCATGAACAACGTGAAGTCGCGGGTTGCAGAGAAGGGTATCATCTCGAGCAGGGAAGAAAGAAACTGGACTCTGCGTCGCAGGGGCAGAGTCTTTCGACGCAGAACGTCTTGCAGAGCATAGAGGAAAGATTTGTTCGTAGGGAGAAAGCCGTGAAGCCGATTGGGATCGTCACGACTGAGTCTAGAGAAACTAGCCTTGATCCTACAGGAAACAAGCGCATTCGTGAAACGGAGGAGAAATTGGTGTGTCCAGTTCCGTCCATTGCTCAAAATCAGAATTTGCAAAAGAGAAGAAAGCTTCTAGAGGCATTGGGAGCGGATGATGGTGTCGAGACGACGGGAGCCTGGGCGGAAGCCAGAGCTAAGTTCGAATGGATGACGCCGTCGAAGATTAGGGATGCTGAAAAGAGGAGGCCCTGCCACCCTGCTTACGACGAGAGAACTCTTCACATACCGCATGATGTTTTCCACAAGTTTTCTGCTTCTCAAAAGCAATACTGGACAACCAAGTGCAAATATATGGACACCTTGTTATTTTTCAAAGTGGGTAAATTTTATGAGCTGTATGAGCTCGATGCTGAAGTAGGACACAAGGAGCTTGACTGGAAGCTGACTGTCAGTGGTGTAGGGAAATGCAGACAGGTTGGTTGTCCAGAGAGCGGAATTGATGATGCAATTCAGAAGCTGGTCTCTCGAGGATACAAAGTTGGTCGGATGGAGCAGATTGAAACGGCTGAACAAGCACGTGCGAAGAGAGGAGATAAAGCTATGGTGCAGCGAGAGCTTGTTCAGATAATCACTCCATCTACAGTGATGGATGGAAACATCAAACCAGAGGCTGTCCACCTCCTGGCTCTCAAAGAGGAGGTACGAGAAGCTACGAATCCCATTCCTGGGAAGGCAGACAAATTGGTGATAGCTATTGGTTTCGCCTTTGTAGATGCTGCTGCTGGCCGCTTCTACGTGGGGTCACTGTCTGATGATACCTCTCTCATCAATCTGAAGACGCTGCTCACCCAGGTTGCGCCTCAAGAGGTACTTTACGAAAGCGGAGGCATATCGAAGGAATCTCTCCGAGCTCTGCGGAGGTTGTCTGCACCAGGTTTACTTCCTGTTACACTTACTCCTCTCCAACCTAGCTTGGAATTTATGGAAGCTGGTGACGCAATCCGCATGCTACGGTCAAATCGTTACTTCACGGATTCCTCGGACAATAATTGGCACACCGAAGGCGACGATTCGTGGCCAACAGCACTGAAATTGTCAGCAGATATTCCTCTGGCTACTTCCGCTCTGGGAGCTCTTGTTTCGCACTTGACCCGCATGAAGTGTGATGGTGAGCTTCTTCCAAACGGATTTCTCTGCCCATACGAAGTTTTTAAAGGCTCTCTGCGGTTGGATGGACAAACTGTTTCCAATTTAGAGCTTCTTGAAAATAGAGACGACGGTGGAAAAGCAGGGACCCTTTTTAATTATCTTGACAGTTGTGTCACGGGTTTTGGTAAGCGACTGCTTCGTCGCTGGATATGCCATCCTCTTCGGAATATTGGAGATATACATTACAGATTGGATGCTGTTGATGAGCTGAACTCCTGGCCAGAGATGACGTGTTCTCTAAGAGCAGGTTTGAGGAAACTACCTGACTTAGAGCGGCTAATTGCTCGAATTCGGAATCTCAGTTTTTCTCCTCTGGCAGGAATACCAACTGCGGCAAAGAAAGTACATCAGAGAAAGTTAAAAGCATTTTGCTCAACAGTTCTTGGAGTGCGCGCAGCTGGAGAACTACTTCTTTTAATCAACAATTTACGTTCTGATGGAAACATAGAGTTGAAGTCCAAATTTTTGCAGGCAGCTGCAGCACTACCATATCCAAAACCTGTTGAAGCTTGTCTGAAGGAACTGGAGTTGGGGATAGATGCGAGCTGTAAAACATGCAGATTATCGAAATCACAGGAGGGTAACTATGTGGACGATGATGAGCAAGAAGACGAGTGGGAAGCAAGGAGACTGAGCCAGCTCATTGATATGTTTAATGAACACACCTCCTTTTGGGTGAAGCTTGTGGATACATTGGGCCAGCTAGATGTGTTGATATCTTTCGCTTCAACAATAAGTGCAGCAAATGGTCCCACATGCAGGCCTCAGTTGGTACCAAGTCCCTGTTTACCGAAAGGAGGCTCTGTCCTTCATATCGAAGGCCTATGGCATCCATATGCATCAGGAGGTCAGGATGGTGCTTTTGTACCTAATGACATTGAACTCGGCCCCGTTAACGCGTCAAGGATTGCTCCGCATGCTATGCTACTAACTGGTCCAAATATGGGAGGCAAGTCAACTCTTCTTCGAGCTACATGCTTGGCAGTAATCATGGCACAGTTGGGCTGCTACGTACCCGGTGAAGCATGCAAATTATCACCTGTGGACACCATCTTCACTAGATTGGGTGCGAGTGATCGCATCATGGCTGGGGAAAGCACCTTCATGGTTGAGTGCAACGAAGCAGCTTCTGTTTTACATCATGCAACAAGTGACTCACTCGTAATACTGGATGAGCTGGGCCGAGGAACATCCACTTTTGATGGCTACGCAATCGCTTATGCTGTATTTCACAGACTCGTCAACAGTCTTGACTGTCGCCTAATTTTTGCTACACACTATCATTCTCTTACTGAAGAATTTGCAACAAACCGTGACGTCAGCCTTCGTCACATGGCTTGCTCCTTCCAGAGCCGTAGTTCTGGGAAAGGTGGGCAGAACTCGTCAGAAAAGAGCGACTGCGATCTTGACAAGGAATTAGTGTTCCTGTACAAATTGACTGAGGGTGCCTCCCCCAAGAGTTTTGGGTTACAAGTAGCTCTACGGGCAGGTATTCCTGGTTCTGTTGTAAAAGCTGCTCATACCGCAGCTAATGCGATGCAGGAATCCCTCCCAGAAACATTTGTTTCAGGAGAAAATTCTAGTGGGTGTAGGTCAAATCAGAGGCAGTTGTTACAGACGATACTCCAGAGTGTCAACCATAAGATTGGAGTGGAAGCATGTCGTGAGAAGAATATTTCTTACGATCTACTTCGAAGTGCCTGGCAAAGTCTTCAGTGCAAAGCAGGAAGTGTCGAGTCCAGTGTTCATAGCGGTGCCGCAACCGCAATACGCAATACGCAAACCGCAAACCGCTGA
Protein:  
MSQQQSLFSFYSRGGAAAKPKANSHRSDFSVVKHEQREVAGCREGYHLEQGRKKLDSASQGQSLSTQNVLQSIEERFVRREKAVKPIGIVTTESRETSLDPTGNKRIRETEEKLVCPVPSIAQNQNLQKRRKLLEALGADDGVETTGAWAEARAKFEWMTPSKIRDAEKRRPCHPAYDERTLHIPHDVFHKFSASQKQYWTTKCKYMDTLLFFKVGKFYELYELDAEVGHKELDWKLTVSGVGKCRQVGCPESGIDDAIQKLVSRGYKVGRMEQIETAEQARAKRGDKAMVQRELVQIITPSTVMDGNIKPEAVHLLALKEEVREATNPIPGKADKLVIAIGFAFVDAAAGRFYVGSLSDDTSLINLKTLLTQVAPQEVLYESGGISKESLRALRRLSAPGLLPVTLTPLQPSLEFMEAGDAIRMLRSNRYFTDSSDNNWHTEGDDSWPTALKLSADIPLATSALGALVSHLTRMKCDGELLPNGFLCPYEVFKGSLRLDGQTVSNLELLENRDDGGKAGTLFNYLDSCVTGFGKRLLRRWICHPLRNIGDIHYRLDAVDELNSWPEMTCSLRAGLRKLPDLERLIARIRNLSFSPLAGIPTAAKKVHQRKLKAFCSTVLGVRAAGELLLLINNLRSDGNIELKSKFLQAAAALPYPKPVEACLKELELGIDASCKTCRLSKSQEGNYVDDDEQEDEWEARRLSQLIDMFNEHTSFWVKLVDTLGQLDVLISFASTISAANGPTCRPQLVPSPCLPKGGSVLHIEGLWHPYASGGQDGAFVPNDIELGPVNASRIAPHAMLLTGPNMGGKSTLLRATCLAVIMAQLGCYVPGEACKLSPVDTIFTRLGASDRIMAGESTFMVECNEAASVLHHATSDSLVILDELGRGTSTFDGYAIAYAVFHRLVNSLDCRLIFATHYHSLTEEFATNRDVSLRHMACSFQSRSSGKGGQNSSEKSDCDLDKELVFLYKLTEGASPKSFGLQVALRAGIPGSVVKAAHTAANAMQESLPETFVSGENSSGCRSNQRQLLQTILQSVNHKIGVEACREKNISYDLLRSAWQSLQCKAGSVESSVHSGAATAIRNTQTANR